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1. Introduction 

Let y and T be two real random variables (rv) with unknown cumulative 
distribution functions (df) F and G respectively, both assumed to be contin- 
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uous. Let X be a real-valued random covariable with df V and continuous 
density v. Under random left-truncation (RLT), the rv of interest y is in- 
terfered by the truncation rv T, in such a way that 3^ and T are observed 
only if y > T. Such data occur in astronomy and economics (see Woodroofe 
[3l] . Feigelson and Babu [TJ, Wang et al. [30], Tsai et al. [29]) and also in 
epidemiology and biometry (see, e.g., He and Yang [T2]). 
If there were no truncation, we could think of the observations as (Xj, y^ Tj) ; 
1 < j < N, where the sample size N is deterministic, but unknown. Under 
RLT, however, some of these vectors would be missing and for notational 
convenience, we shall denote pQ,l^,Tj) ; 1 < % < n, (n < N) the observed 
subsequence subject to Yi > Tj from the N— sample. As a consequence of 
truncation, the size of actually observed sample, n, is a binomial rv with 
parameters N and \x := IP (y > T) > 0. By the strong law of large numbers 
we have, as iV — > oo 

n _ 
^ n := N ~* /i ' a - s - 
Now we consider the joint df F(., .) of the random vector (X, y) related to 
the iV— sample and suppose it is of class C 1 . The conditional df of 3^ given 
X = x =: (xi, Xdf , that is F(y\x) = IE l{y< y }\X = x which may be 
rewritten into 

F(. W = %! (2) 
v(x) 

where Fi(x, .) is the first derivative of F(x, •) with respect to x. For all fixed 
p G (0, 1), the p th conditional quantile of F given X = x is defined by 

q p (x) := inf {y G 1R : F(y\x) > p} . 

It is well known that the quantile function can give a good description of 
the data (see, Chaudhuri et al. [5J), such as robustness to heavy-tailed error 
distributions and outliers, especially the conditional median function qy 2 (x) 
for asymmetric distribution, which can provide a useful alternative to the 
ordinary regression based on the mean. The nonparametric estimation of 
conditional quantile has first been considered in the case of complete data 
(no truncation). Roussas [23] showed the convergence and asymptotic nor- 
mality of kernel estimates of conditional quantile under Markov assumptions. 
For independent and identically distributed (iid) rv's, Stone [27] proved the 
weak consistency of kernel estimates. The uniform consistency was studied 
by Schlee [2S] and Gannoun [9]. The asymptotic normality has been estab- 
lished by Samanta [25]. Mehra et al. [20] proposed and discussed certain 
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smooth variants (based both on single as well as double kernel weights) of 
the standard conditional quantile estimator, proved the asymptotic normal- 
ity and found an almost sure (a.s.) convergence rate, whereas Xiang [32] 
gave the asymptotic normality and a law of the iterated logarithm for a new 
kernel estimator. In the dependent case, the convergence of nonparametric 
estimation of quantile was proved by Gannoun [10] and Boente and Fraiman 

In the RLT model, Giirler et al. [11] gave a Bahadur-type representation for 
the quantile function and asymptotic normality. Its extension to time series 
analysis was obtained by Lemdani et al. [15] . 

The aim of this paper is to establish a strong uniform convergence rate for 
the kernel conditional quantile estimator with randomly left-truncated data 
under a— mixing conditions whose definition is given below. Hence, we extend 
the obtained result by Lemdani et al. [16] in the iid case. 
First, let J-f{Z) denotes the a-field of events generated by {Zj, i < j < k}. 
For easy reference, let us recall the following definition. 

Definition 1.1 Let {Zi, i > 1} denotes a sequence of random variables. 
Given a positive integer n, set: 

a(n) = sup{|P(An£) -W(A)TP(B)\ : A e T\{Z), B g F™ +n (Z), k e IN j . 

The sequence is said to be a— mixing (strongly mixing) if the mixing coeffi- 
cient a(n) — > 0. 

Among various mixing conditions used in the literature, a— mixing is rea- 
sonably weak and has many practical applications (see, e.g. Doukhan [B] or 
Cai ([3111] for more details). In particular, Masry and Tj0stheim [18] proved 
that, both ARCH processes and nonlinear additive AR models with exoge- 
nous variables, which are particularly popular in finance and econometrics, 
are stationary and a— mixing. 

The rest of the paper is organized as follows. In Section 2, we recall a defini- 
tion of the kernel conditional quantile estimator with randomly left-truncated 
data. Assumptions and main results are given in Section 3. Section 4 is de- 
voted to application to prediction. Finally, the proofs of the main results are 
postponed to Section 5 with some auxiliary results and their proofs. 
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2. Definition of the estimator 

In the sequel, the letters C and C are used indiscriminately as generic con- 
stants. Note also that, N is unknown and n is known (although random), our 
results will not be stated with respect to the probability measure IP (related 
to the N— sample) but will involve the conditional probability P (related to 
the n— sample). Also IE and E will denote the expectation operators related 
to IP and P, respectively. Finally, we denote by a superscript (*) any df that 
is associated to the observed sample. 

The estimation of conditional df is based on the choice of weights. For the 
complete data, the well-known Nadaraya- Watson weights are given by 



w f v K{{x-Xj)/h N } {Nh^Kjjx-x^/hN} 

^ (x) ~Ef^{(*-x 4 )/M~ v N (x) 



that are measurable functions of x depending on X 1 , ...,X N , with the con- 
vention 0/0 = 0. The kernel K is a measurable function on IR d and (/ijy) a 
nonnegative sequence which tends to zero as N tends to infinity. The regres- 
sion estimator based on the iV-sample is then given by 

, v _ (Nh^^Y.Kjjx-X^/hj,} 

where vn is a well knwon kernel estimator of v based on the A^— sample. As 
iV is unknown, then vn(-) cannot be calculated and therefore vn(-). On the 
other hand, based on the n-sample, the kernel estimator 

*(x) = ±±K (Z^*L) (5) 



n v 



nh n ^{ V h 



is an estimator of the conditional density v*(x) (given y > T), see Ould-Sa'id 
and Lemdani [2T] . 

Under RLT sampling scheme, the conditional joint distribution (Stute, 
[28]) of (Y,T) becomes 

J*(y,t) = P (y <y,T<t) = JP(Y <y,T<t\Y>T) 
= yC l f V G(tAu)dF(u) 
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where t A u := min(t, it). The marginal distribution and their empirical ver- 
sions are defined by 



F*(y) = yr 1 f G(u)dF(u), F*{y) = n'^l 



/oo n 
G(tAu)dF(u) and G* n {t) = n^^l^ 
-°° »=i 

where 1,4 denote the indicator function of the set A. 
In the sequel we use the following consistent estimator 



<*}> 



G n {y) {(l-F n {y-))] 
C n (y) 



(6) 



for any y such that C n (y) ^ 0, where F n (y—) denotes the left-limite of F n 
at y. Here F n and G n are the product-limit estimators (Lynden-Bell [17] ) for 
F and G, respectively i.e., 



F n (y) = i 



n 

i/Yi<y 



nCniYi) 



, G„(y)= II 



nC n (T t ) - 
nC n (Ti) 



nc n (y<) 

where C n (y) = w _1 X)it=il{Ti<fl<Yi} is the empirical estimator of 

C(?/) = P(T<y<r|r >T). 

He and Yang [T3] proved that /i n does not depend on y and its value can 
then be obtained for any y such that C n (y) ^ 0. Furthermore, they showed 
(see their Corollary 2.5) its JP—a.s. consistency. 

Suppose now that one observes the n triplets (Xj,Yj,Tj) among the N 
ones and for any df L, denotes the left and right endpoint of its support 
by dL '■= inf {x : L(x) > 0} and hi := sup {x : L(x) < 1} , respectively. Then 
under the current model, as discussed by Woodroofe [31], F and G can be 
estimated completely only if 

/■oo ^^T 1 

a G < &g < bp and / — - < oo. 

J ap G 

In order to estimate the marginal density v we have to take into account the 
truncation and the estimator 



v n (x) 



h n i=l G n (Yi) 



K 



h, 



(7) 
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is considered in Ould-Said and Lemdani |15j . Note that in this formula and 
the forthcoming, the sum is taken only for i such that G n (Yi) ^ 0. 
Then, adapting Ould-Sa'fd-Lemdani's weights, we get the following estimator 
of the conditional df of y given X = x 



F n (y\x) = ^YW^G-^H 1 11 * ' 

i=l 



hr. 



A 1 T ^ fx - X, 



Fi, n (x,y) 
v n (x) 

where if is a distribution function defined on H, and 



(8) 



is an estimator of Fi(x,y). As the latter is continuous, it is clear that it 
is better to define a smooth estimator by using a continuous function H(-) 
instead of a step function /{.}. We point out here that the estimators ([8]) and 
([9]) have been already defined in Lemdani et al. [T6] . 
Then a natural estimator of the p th conditional quantile q p (x) is given by 

q p>n (x) := inf {ye XI: F n (y\x) > p} . (10) 

which satisfies F n (q Pjn (x)\x) = p. 



3. Assumptions and main results 

In what follows, we focus our attention on the case of a univariate covariable 
(d = 1) and denote X for x and K for K\. Assume that = ac < ap and 
be < bp. We consider two real numbers a and b such that ap < a < b < bp. 
Let fl be a compact subset of Q = {x G JR\v (x) > 0} and 7 := inft>(x) > 0. 

We introduce some assumptions, gathered below for easy reference needed 
to state our results. 
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(Kl) K is a positive-valued, bounded probability density, Holder continuous 
with exponent (3 > and satisfying 

\u\K(u) — >0 as |w| — > +oo. 

(i^2) H is a df with C 1 — probability density if*- 1 -* which is poisitive, bounded 
and has compact support. It is also Holderian with exponent (5. 

(K3) i) if ^ and K are second-order kernels, 
ii) J K 2 (r)dr < oo. 

(Ml) {(Xi, Yi) ; i > 1} is a sequence of stationary a-mixing random variables 

with coefficient a (n) . 
(M2) {Ti] i > 1} is a sequence of iid truncating variables independent of 

{(Xi, Yi) , i > 1} with common continuous df G. 
(M3) There exists v > 5 + 1/(3 for some /3 > 1/7 such that 

Vn, a (n) = O . 
(Dl) The conditional density i>*(.) is twice continuously differentiable. 
(D2) The joint conditional density v*(., .) of (Xi,Xj) exists and satisfies 

sup \v*(r, s) — v*(r)v*(s)\ < C < oo, 

r,s 

for some constant C not depending on (i, j) . 
(D3) The joint conditional density of (Xi, Yi, Xj, Yj) denoted by f* (.,.,., .), 
exists and satisfies for any constant C, 

sup \f*(r,s,t,u) - r(r,s)f*(t,u)\ < C < oo. 

r,s,t,u 

(DA) The joint density /(.,.) is bounded and twice continuously differen- 
tiable. 

(-D5) The marginal density v (.) is locally Lipschitz continuous over flo. 

The bandwidth h n =: h satisfies: 
(HI) 

logn 

h I 0, — : >0 and h — o (1/ logn) , as n — > oo, 

(H2) 

(3-i/)/3 . i 



where 77 satisfies 



2 (f-3)/3 1 



+ 1) + 4/3 + 1 ' p(v + 1) + 4/3 + 1 l-i/' 
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v and (3 are the same as in (M3). 

Remark 3.1 Assumptions (K) are quite usual in kernel estimation. Condi- 
tions (-D1), (-D4) and (-D5) are needed in the study of the bias term. (-D2) 
and (-D3) are needed for covariance calculus and take similar forms to those 
used under mixing. Hypothesis {H2) is used in Ould-Said and Tatachak [22] 
and is needed to establish Lemma 5.1 and Lemma 5.4- Assumptions (M) 
concern the mixing processes structure which are standard in such situation. 
The choice of (3 seems to be surprising, but it is only technical choice which 
permit us to make one of the variance term to be negligible. 

Remark 3.2 Here we point out that we can not suppose that the original 
data (that is the N -sample) satisfies some kind of dependency. Indeed, we do 
not know if the observed data are a-mixing or are not. And if they are, we do 
not know the coefficient. Therefore, we suppose that the observed data satisfy 
some kind of mixing condition. 

Remark 3.3 As we are interested in the number n of observations (N is 
unknown), we give asymptotics as n — > oo unless otherwise specified. Since 
n < N, this implies N —>■ oo and these results also hold under P — a.s. as 
N -> oo. 



Our first result, stated in Proposition 3.1, is the uniform almost sure con- 
vergence with rate of the conditional df estimator defined in (jSJ). 

Proposition 3.1 Under assumptions (K), (M), (D) and (H), we have 



sup sup \F n (y\x) — F(y\x) \ = O I max I J — ^— , h 2 \ I , P—a.s. as n — » oo 



x£fl a<y<b 



nh 



The second result deals with the strong uniform convergence with rate of 
the kernel conditional quantile estimator o P) „(.) which is given in the following 
theorem. 



Theorem 3.1 Under the assumptions of Proposition 3.1 and for each fixed 
p G (0, 1) if the function q p satisfies for given e > there exists (3 > such 
that 

\/ri p :tt^ H, sup \q p (x) - r] p (x)\ > e =>• sup \F (q p (x)) - F {r) p (x))\ > (3, 

(11) 
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we have 

lim sup \q p , n (x) - q P (x)\ = 0, P - a.s. 
Furthermore, we have 



SU P \qp,n(x) — q p (x)\ = O [ max ^ y — ^— , /i 2 ^ ] , P — a.s. as n — > oo 



4. Applications to prediction 

It is well known, from the robustness theory that the median is more robust 
than the mean, therefore the conditional median, fj,{x) = qi/z(x), is a good 
alternative to the conditional mean as a predictor for a variable Y given 
X = x. Note that the estimation of p,(x) is given by fi n (x) = qi n (x). Using 
this considerations and section 2, we want to predict the non observed r.v. 
Y n+ i (which corresponds to some modality of our problem), from available 
data Xi, . . .,X n . Given a new value X„ +1 , we can predict the corresponding 
response Y n+1 by 

Yn+l — fl"n{X n +i) = qi/2,n\Xn+l)- 

Nevertheless to say, that the theoretical predictor is given by fi(X n+ i) = 

qi/2{x n +i)- 

Applying the above Theorem, we have the following corollary: 
Corollary 4.1 Under the assumptions of Theorem 3.1, we have 



qi/2,n{X n +i) — qi/2(X n+ i / 
5. Proofs 



0, P — a.s. as n — > oo. 



We need some auxiliary results and notations to prove our results. The first 
lemma gives the uniform convergence with rate of the estimator v*(x) defined 
in flSJ). 

Lemma 5.1 Under (Kl), (K3), (M), (Dl), (D2) and (H) we have 



sup \v*(x) — v*(x)\ = O I max I J °^ , h 2 \ I , P — a.s. as n — » oo 



nh 



imsart-generic ver. 2008/01/24 file: ejs_2008_306.tex date: October 7, 2008 



Ould-Said, Yahia & Necir /Conditional quantile for truncated dependent data 10 



Proof. We have 



sup \v*(x) - v*(x)\ < sup \v*(x) - E [v*(x)] \ + sup |E [v*(x)] - v*(x) 
xen xen xen 



— '■ + 2i 



'-2n- 



(12) 



We begin by study the variability term X\ n . The idea consists in using 
an exponential inequality taking into account the a— mixing structure. The 
compact set Q can be covered by a finite number l n of intervals of length 
= (n~ l h l+2 ^)w , where (3 is the Holder exponent. Let I k := I(x k ,uj n ); 
k = 1, . denote each interval centered at some points x k . Since Q is 
bounded, there exists a constant C such that w n l n < C. For any x in Q, 
there exists I k which contains x such that \x — x k \ < u> n . We start by writing 



x-Xj 
h 



E 



K 



X-X-l 



Clearly we have 

n 

EA, (*) = {«(*) - v* n (x k )) - (E [<(*)] - E [v* n (x k )])} + - E K(x k )}) 

i=i 

n n 

=: ^A,(x) + EA,(x fc ). 

Hence 



i=i 



i=i 



sup 



i=l 



< max sup 

i<fc<;„ xeIk 



EA,( 



i=i 

'S'ln + S2n- 



+ max 

i<fc<2„ 



E A i 

i=l 



(13) 



Firstly, we have under assumption (Kl), 



sup 

xei k 



E A i 
i=i 



< — F 



'i=l 



K 



K 



x - Xj 
h 

x-X 1 
h 



— K 
— K 



x k - Xj 
h 

Xk — X 1 
h 



< 



2sup :ce/fc \x-x k \ 



< c^h- 1 -" = o ((nhy 1/2 ) 



(14) 
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Hence, by (HI) and for n large enough, we get S\ n = op(l). 
We now turn to the term 5*2 n in ffTB"]) . Under (Kl), the rv's Ui = nhAi (%k) are 
centered and bounded. The use of the well known Fuk-Nagaev's inequality 
(see Rio [23j formula 6.19b, page 87]) slightly modified in Ferraty and Vieu 
[HI see proposition A.ll-ii), page 237], allows one to get, for all e > and 
r > 1 



P < max 

i<fc<z n 



E A i 



1=1 



fc=i 



< Cu/ 1 



n r 



> nhe 



e 2 n 2 h 2 * 



where 



Putting 



r \enh 

'■ '■ Qln + Q2n 



4= E E 1^04^ 

l<i<n l<j<ra 



(15) 



r = (logn) 1+<5 , where 5 > 0, and e = e 1 



'logn 



for some Eq > 0. (16) 



We have 



Q ln = Cin-'h^W 



I n 



(logn 



,1+5 \ "+i 



(logn) 1+5 \EQ^/nh logn ) 

^Cn 1 -^ + ^/ i -(i +1+ ^)(logn)^- H * 1 £o (l ' +1) . 

Note that under (M3) , it is easy to see that the following modified assump- 
tion (H'2) of (H2) hold, 



(3-v)0 | i 

(7 n ^(,+i)+2/3+i+ J ' < /i < C" n ~. 



where 77 satisfies 



1 



<T] < 



(*/-3)/3 



1 



(#'2) 



(17) 



+ 1) + 2/3 + 1 ' (3(u + 1) + 2/3 + 1 l — i/ 
z/ and f3 are the same as in (M3). 
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Then, from the left-hand side of (\H'2\\ 

Qm < C\\ognY^-^n- 1 -^ v+1 ^ 1 -^\ 

Hence, for any rj as in ( ITTjl . Q± n is bounded by the term of a finite-sum series. 
Before we focus on Q 2n , we have to study the asymptotic behavior of 

n 

s 2 n = Yyar{U i ) + Y,\Cov(U t ,U 1 )\ 

i=l i^j 
var I cov 

First, by (K3 : ii), (Dl) and a change of variable, we obtain 
s v n ar = nVar{U x ) 



K 1 



h 



E 



R f x k -X l 
\ h 



= 0{nh). (18) 
On the other hand, a change of variable, (Kl), (Ml) and (D2) lead to 
ICoviUi,!/^ = \E[UiUj]\ 

< j j K (^^j K {p^-^j \v*(r,s)-v*(r)v*(s)\drds 



= O [h 2 ) . (19) 

Note also that, these covariances can be controlled by means of the usual 
Davydov covariance inequality for mixing processes (see Rio [231 formula 
1.12a, page 10] or Bosq [21 formula 1.11, page 22]). We have 

V^j, \Cov(U u U 3 )\<Ca(\i- 3 \). (20) 
To evaluate s c ° v , we use the technique developed by Masry [18]. Taking 



(where [".] denotes the smallest integer greater than 
the argument), we can write 

sT= E \Cov(U t ,U J )\+ £ \Cov(U u U 3 )\. (21) 

0<\i~j\<fn \i-j\>fn 

First, applying the upper bound f[T9~j) to the first covariance term in (I2T1) . we 

get 

£ \Cov(Ui, Uj)\ <Cnh 2 ip n . (22) 

0<\i—j\<<Pn 
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For the second term, thanks to fl20l) we get 

E ICov^U^KC £ a (|< - j|) 

I*-J'l>¥>n |*-j|>¥>» 

< Cn 2 a (<p n ) . (23) 
According to the right-hand side of <\H'2b , using (M3), (1221) and (|23|) . we get 

4- = 0(n/i). (24) 
Finally, (JTHJ) and (El) lead directly to 4 = O (nh) . 

This is enough to study the quantity Qm-, since for e and r as in ( fl6l) and 
Taylor expansion of log(l + x) allows us to write that 



Q 2n = Cu„ 1 exp 



r / einhlogn 
-;'og 1 + 



2 \ rs r , 
< Cn^ c ' £ °h- {l+ ^ ] 



By using flif'2|) and (M3), the later can be made as a general term of a 



convergent series. Hence J2 n >i (Qin + Q2n) < oo, and therefore by Borel- 
Cantelli's Lemma, we have 



1\ n — O \ \l - | . P - r/..s. MS // — V 

nh 

On the other hand, the bias term l2 n does not depend on the mixing struc- 
ture. We prove its convergence by using a change of variable and a Taylor 
expansion (see Lemma 6.1 in Lemdani et al. [IS]). We get, under (-^3) and 
(Dl) 

l2n = O (h 2 ) , P — a.s. as n oo 



Hence, replacing X\ n and X 2n in (fl2|) . we get the result. ■ 

The following Lemma is Lemma 4.2 in Ould-Said and Tatachak [22], in which 
they state a rate of convergence for \x n under a-mixing hypothesis, which is 
interesting in itself, similar to that established in the iid case by He and Yang 

[m 
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Lemma 5.2 Under assumptions (M), we have 



\jX n - fJi\=0 



'log logn 



n 



P — a.s. as n —>■ oo. 



Proof. See Lemma 4.2 in Ould-Said and Tatachak [22J. 
Adapting (jUj), define 



nh^GiYi) V h 



h 



(25) 



Lemma 5.3 Under the assumptions of Lemma 5.1 and (K2), we have, 
F lttl (x,y) - F 1>n (x,y) 



sup sup 

x££l a<y<b 



log log n 

(J | \l | , P — a.s. as n — > oo 



n 



Proof. Under (K2), the df H is bounded by I. Hence 

fin ~ AgJ , HSUPy>a P I <^n (j/) ~ G(y) \ \ 



Fi, n (x,y) - F hn (x,y) 



< 



G n (ap) G n (aF)G(ap) j 



From Lemma 5.2, we have 



\/jL n - H\=0 



'log logn 



P — a.s. as n —>■ oo 



Moreover, G n (ap) P a ' s ' n ^°° G(ap) > 0. In the same way and using 
Remark 6 in Woodroofe [31] we get 

sup \G n (y) - G{y)\ = O (r^ 1 / 2 ) P - a.s. 



as n — > oo 



V>a.F 



Combining these last results with Lemma 5.1, we achieve the proof. 



Lemma 5.4 Under assumptions (K), (M), (D3), (DA), and (H), we have, 
Fi, n (x, y) - E [Fi,n(ac, v)] = O 



sup sup 

x£fl a<y<b 



'logn 



nh 



P— a.s. as n — > oo 
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Proof. The proof is analogous to that in Lemma 5.1. We give only the 
leading lines. As Q and [a, b] are compact sets, then they can be covered by 
a finite number l n and d n of intervals Ii,...,Ii n and Ji, Jd n of length uj n 

as in Lemma 6.1 and A„ and centers x 1: ... : Xi n and y u ...,y dn 

respectively. Since Q and [a, b] are bounded, there exist two constant C\ and 
C*2 such that l n u n < C\ and d n X n < Hence for any (x, y) G fix [a,b], 
there exist and yj such that |x — Xk\ < ^« and |y — y^- j < A n . Thus we 
have the following decomposition 

sup sup F liTl (x,y)-E F 1>n (x,y) 

x&Q. ye[a, b] 

< max sup sup F l n (x, y) - F l n (x k , y) 



+ max max sup 

l<fc</ n l<j<d n ye j. 



+ max max 

l<k<l n l<j<d n 



Fi, n (xk,y) - Fi, n (x k ,yj) 
Fi !n (x k , yj) - E F hn (x k , 

-E F hn (x k ,y) 



+ max max sup E F^ n (x k ,yj) 

l<k<l n l<j<d n y( zj. L 



+ max sup sup 

l<k<l n x( zj k y 



E 



E 



^i,n(^,y) 



Fi, n {xk,y) 

= '■ J\n + iTSin + i73n + J7"4n + Jhn- 

Firstly, concerning J\ n and J 5n , assumptions (Kl) and (if2) yield 

'y-Yi 



sup sup 

xe/fe y 



F hn (x,y) - F 1:7l (x k ,y) 
Hence, by (HI) we get 



" G(a F )V+f> T 



H 



h 



= O ((nhy 1/2 ) . 



nh r 



■ sup sup 



V logn x£ n y e[a, b] 
Similarly, we obtain for J 2n and J± n 

Fi, n (x k , y) - F hn (x k , yj) 

Again, by (HI) we get 



F hn (x,y) - F 1>n (x k ,y) =o(l). 



(26) 



sup 

yeJj 



< 



G(a F )h l +P 



K 



( x k - Xj 
V h 



-1/2 



nh 



sup sup 



Fi, n (xk, y) ~ F hn (x k , yj) = o(l). 



(27) 



V logn x en ye [ a , b] 
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As to Js n , for all e > we have 



P<^ max max F ln (x k , yj) - E F 1>n (x k ,yj) 

[_l<k<l n l<j<d n L 

< /„ d n P{ F 1>n (x k , yj) - E Fi tn (x k , yj) > e}. 
Set, for any z > 1, 



> £ 



(28) 



^i(xk,yj) ■-- 



nh { G(Yi) 



1 ^ f x k -Xj \ H f Vj - Yi 



h 



h 



- E 



1 K f x k -X 1 \ H fy j -Y i 



by 2 ^° j f) fl =: C < oo. Then, applying again Fuk-Nagaev inequality, we 



_G(y f ) V h J " V /i 

Under (Ifl) and (i^2), the rv's Vi := nh^/i(x k ,yj) are centered and bounded 
^ ^f^T =: C < oo. Then, a PI 
obtain that, for all £ > and r > 1, 

f n 

> n/i£ j 

£ 2 n 2 /i 2 V 5 



P < max max 

l<fc<Z n l<j<d„ 



i=l 



> e 



P < max max 

l<k<l n l<j<d n 



i=l 



i= 



n / 2r ^ "+' 
r \enh. 



1 + 



\enh 



2^2l,2\ "2 



32h • 



(29) 



where s 2 = £ £ |Cou(V5, V$)l ■ 

l<i</i l<j<n 

By taking £ and r as in (TIBI) . we get 



i731n — C£ 



n 



?-^(logn)^ 1+ ^-^/i^ (1+4/3+(iy+1)/3) 



Then, using (.HI) and (H2) we get 

Jsm < C£o ( " +1) n- 1 -*( 1+4/3+ ^-|)(logn)^ 1+a )-^. 

Hence, the condition upon (3 and for any r\ as in (H2), J^i n is the general 
term of a finite-sum series. 
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Let us now examine the term J^32n- First, we have to calculate 
8 l = nVar(y 1 )+'E\C v{V i ,V j )\. 



We have 



VariVx) = E 
— E 2 



//- r2 / x k -Xj \ h2 f Vj - Y 1 



G 2 (Fi) 



h 



h 



Vi + V 2 . 



// K / ./•;, •- X, \ n (Uj V, 



h 



Remark that 



E 



G 2 (Y 1 



-H 2 



Vi-Yi 
h 



X 1 



» H 2(yL^y± ]r{yi \ Xl)dyi 



G 2 (y 



h 



Then 



E 



G(Y 1 ) 



H 2 



yj - Yi 
h 



Vi = E 



/' f x k ~ Xi\ h2 (yj - Yi 



G(Y 1 



-K 2 



h 



< 



< 



G(o,f) 
G(a F ) 



-E 



2 / X k - Xj 



J K 2 (r) v* (xk — rh) dr. 



Under (K3 : ii) and (Dl), we have V± — O (h). An analogous developpement 
gives that V2 = O (h 2 ) , which implies Var(Vi) — O (h) . 
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On the one hand, (Ml), (Kl) and (K2) lead to 



\Caviyi,Vj)\ 



V K fXh - u\ u (Vj -T\ fi T ^ fx k - s 



G{r) V h 



H 



K . 

h ) G(t) V h 



x H 



h 
G{r) 



f*,ij+ij+i( u i r ' s ' t)dudrdsdt 



-K 



h 



x 



< 



G(t) V /i 



h 

y 3 - t 
h 



K 



h 



H 



f*(u, r)dudr 
f*(s, t)dsdt 



Vj ~ r\ fx k - s 



h 



x (fi,i, j+ i,j+i(u, r, s, t) - f*(u, r)f*(s, t)) 



V h 



dudrdsdt. 



H 



Vj 



h 



Using Assumption (D3) and by a change of variable, it follows that 

\Cov(V l ,V J )\ = 0(h 4 ). (30) 

On the other hand, from a result in Bosq [21 p.22], we have 

\Cov(V u V J )\ = 0(a(\z-j\)). (31) 

Then to evaluate \Cov(Vi, Vj)\ , the idea is to introduce a sequence of 

integers (p n the same as in Lemma 5.1, and using ( 1301) for the nearest and 
fl3TT) for the farest integer i and j. Then we get 

Y,\c™{v u Vi)\ = £ E icwo^i + E £ |c™04^)l 
<E E ^ 4 + E E 

0<\i—j\<<Pn I*— il>Vn 

< Cmp n h A + Cn 2 a (</>«) )• 
The right-hand side of (H2) and (M3) , one has £ |C(w (V-, Vj)\ = O (nh) . 
So s n = O (nh) . 

Consequently, by taking r and e as in ffl6|) and using Taylor expansion of 
log(l + x), the term J-$in becomes 

Js2n < Cnh-(^ +2 "> exp {--eglogn} 
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By using (H2) and (M3) , the later can be made as a general term of summable 
series. Thus J2 n >i (<^hin + Jz2n) < oo. Then by Borel-Cantelli's Lemma, 
the first term of (129]) goes to zero a.s. and for n large enough, we have 

/ log n 

Jzn — O \ \l — — ] , this complet the proof of the Lemma. 



Lemma 5.5 Under assumptions (K3) and (DA) we have, 



sup sup 

x£Q ci<y<b 



E 



F\,n{x,y) — Fi(x, y) = O (h 2 ^j , P — a.s. as n —>■ oo 



Proof. The bias terms do not depend on the mixing structure. The proof 
of Lemma 5.5 is similar to that of Lemma 6.2 in Lemdani et al. [IS], hence 
its proof is omitted. ■ 

The next Lemma gives the uniform convergence with rate of the estimator 
v n (x) defined in (J7j). 

Lemma 5.6 Under the assumptions of Lemma 5.1 and condition (D5), we 
have 



sup \v n (x) — v(x)\ = O I max { \ °^ n , h 2 \ I , P — a.s. as — > oo. 

xen \ | V nh 

Proof. Adapting ([7j), define 

fi » 1 fx - Xi 



We have 



-^Scc^hr 1 )- (32) 



sup \v n (x) — v(x)\ < sup \v n (x) — v n (x)\ 



+ sup \ v n (x) - E [v n (x] 
+ sup |E [^(x)] — v(x) 



3w • 
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For the first term, using analogous framework as in Lemma 5.3, we get 

(33) 



In 



o 



I log log n 



P — a.s. as 



oo. 



n 



In addition, by using the same approach as for X\ n in the proof of Lemma 
5.1, we can show that, for n large enough 



2n 



o 



'logn 
nh 



P — a.s. as 



oo. 



(34) 



Finally, a change of variable and a Taylor expansion we get, under (K3) and 
(D5) 



E [v n (x)} - v(x) = E 



1 

h 

h 2 



nhfr{G{Y l 
K ' ' " " 



-K 



x — X\ 
h 



— v(x) 



h 



v(u)du — v(x) 



= — J r 2 K(r) v'\x)dr 
with x G [x — rh, x] , which yields that 



Czn = 0(h 



P — a.s. as n — > oo. 



(35) 



Combining fl33|) . fl34l) and (1351) permit to conclude the proof. 



Proof of Proposition 3.1 In view of we have the following classical 
decomposition 



sup sup \F n (y\x) - F(y\x)\< 



a<y<b 



Furthermore, we have 



sup sup \Fi tn (x,y) - Fi(x,y)\ 



7 - SUp \V n {X) - V{X)\ {x€na<y<b 

+ 7 _1 sup sup |F(y|x)| sup \v n {x) — v{x) \ > . 

x££la<y<b x£fl J 



sup sup \F ltri (x,y) - Fi(a;,y)| < sup sup F hn (x,y) - F hn (x,y) 

x£f2 a<y<b x£fl a<y<b 

+ sup sup Fi jn (x,y)-E Fx >n (x,y) 

x£Q a<y<b 

+ sup sup E Fi >n (x,y) - Fi(x,y) 
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In conjunction with Lemmas 5.3-5.6, we conclude the proof. ■ 
We now embark on the proof of Theorem 3.1. 

Proof of Theorem 3.1 Let x G f2. As F n (-\x) and F(-\x) are continuous, 
we have F(q p (x)\x) = F n (q Ptn (x)\x) = p. Then 

\F(q P7n (x)\x) - F(q p (x)\x)\ < \F(q p , n (x)\x) - F n (q Ptn (x)\x)\ 

+ \F n (q P ,n( X )\ X ) - F (Qp( X )\ x )\ 

< \F(q Pjn (x)\x) - F n (q Ptn (x)\x)\ 

< sup \F n (y\x)-F(y\x)\. (36) 

a<y<b 

The consistency of q Pjn (x) follows then immediately from Proposition 3.1 in 
conjunction with the inequality 

{ su p \%A X ) - %( x )\ > ^ f < H i su p su p \ F n(y\ x ) - F (y\ x )\ >@\- 

n [x£fl ) n ^xSf2 a<y<b J 

For the second part, a Taylor expansion of F(.\.) in neighborhood of q p , 
implies that 

F(q P , n ( x )\ x ) ~ f ((1p( x )\ x ) = ((1pA x ) ~ Qp( x )) f (%( x )\ x ) ( 37 ) 

where q p is between q p and q P)Tl and f (-\x) is the conditional density of 
Y given X — x. Then, from the behavior of F(q Pjn (x)\x) — F(q p (x)\x) as 
n goes to infinity, it is easy to obtain asymptotic results for the sequence 
{q P ,n( x ) - %( x )) ■ B y d2ZD we have 

sup \q p , n (x) - q p (x) \ \f (q p (x)\x)\ < sup sup \F n (y\x) - F(y\x)\ . 

x&Q. rr£f! a<y<b 

The result follows from (-D4) and the Proposition 3.1. Here we point out that, 
if f (q p (x)\x) = 0, for some x G Q, we should increase the order of Taylor 
expansion to obtain the consistency of q Pt n(x) (with an adapted rate). ■ 
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